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Abhik Ghosh^, Ayanendranath Basu^ 

Indian Statistical Institute 


Abstract 

We present a robust test for composite null hypothesis based on the general S- 
divergence family. This requires a non-trivial extension of the results of Ghosh 
et al. (2015). We derive the asymptotic and theoretical robustness properties 
of the resulting test along with the properties of the minimum S'-divergence 
estimators under parameter restrictions imposed by the null hypothesis. An 
illustration in the context of the normal model is also presented. 

Keywords: Parameter Restriction, Gomposite Hypothesis Testing, 

Robustness, S'-Divergence. 


1. Introduction 


Statistical tests for composite hypotheses are encountered all the time in all 
disciplines of applied sciences. For such composite hypotheses, the null param¬ 
eter space is generally defined through some pre-specified restrictions and one 
needs to estimate the parameter value under those restrictions to perform the 
test. The most common and widely used statistical tool to solve this inferential 


problem is the classical likelihood ratio test (Neyman and Pearson 1928 Wilks 


19381 which utilizes the maximum likelihood estimator of the parameters un¬ 
der given restrictions. However, the non-robust nature of such likelihood based 
solutions under misspecification of models and/or presence of outliers is well- 
known. So, there have been many attempts for developing robust alternative to 
the likelihood ratio test (LRT) with good asymptotic and robustness properties. 


Ghosh et al. (2015) proposed a general family of tests of hypothesis for the 
simple null problem; these tests are based on the family of S'-divergences ([Ghosh 


et al. 2013aI, and extend the idea of Basu et al. (2013a) who considered testing 
of hypothesis based on the density power divergence (Basu et ah, 1998). In 
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the present paper we provide a non-trivial generalization of the Ghosh et al. 


(2015) paper, and the theoretical robustness properties described in this work 
also provide the theoretical underpinnings of the Basu et al. ( 2013b( ) tests as a 
special case. We also study the corresponding minimum divergence estimators 
under the restrictions imposed by the null. 

In this paper, we presents the restricted minimum S'-divergence estimator 
and its asymptotic distribution for both the discrete and continuous models. 
The main focus of the paper is on the theoretical robustness properties of the 
^-divergence based tests of composite hypothesis. For brevity in presentation, 
the proofs of all the results are provided in the online supplement to this paper. 


2. The Restricted Minimum S-Divergence Estimators (RMSDE) 


The ^-divergence family has been recently introduced by Ghosh et al. (2013a) 


and contains several popular density-based divergences like the power divergence 


(PD) family of Cressie and Read (1984) and the density power divergence (DPD) 
family of Basu et al. ( |1998 ). For two densities g and /, it is defined in terms of 
two parameters 7 € [0,1] and A € K as 

Si,M9J) = \ j / /V + l J 5'+^ A ^0,5^0, 

( 1 ) 

where A = 1 + A(1 — 7 ) and R = 7 — A(1 — 7 ). Whenever Al = 0 or R = 
0, the corresponding S'-divergence measure is defined by the continuous limits 
of 0 as a 1 —>• 0 or R —>• 0 respectively. Several properties and applications 
of the (unrestricted) minimum S-divergence estimators have been studied by 
Ghosh et al.| ( Ml3a|b ), Ghosh (2014b), Ghosh and Basu (2014) and Ghosh et 
al. (2015). Here, we consider the minimum S-divergence estimators under some 


pre-specified parameter restrictions and study their asymptotic properties. The 
general theory of robustness for general minimum divergence estimators under 


parameter restrictions has been recently developed by Ghosh (2014a), which 
also contains the case of the S-divergence measures. 


2.1. Definition and Estimating Equation 

Consider the standard set-up of parametric inference, where we have a sam¬ 
ple Xi ,..., Xn from the true density g which is modeled by a parametric family 
of densities R = {/6/:6)G0CIRP}. We assume a set of r restrictions on the 
parameter 9 given by 

H9) = 0, (2) 

such that the p x r matrix defined by H{9) = exists with rank r and is a 
continuous function of 9. 

The restricted minimum S-divergence estimator (RMSDE) of 9 is to be ob¬ 
tained by minimizing S(^a,X){g, fe) subject to the constraints ([^; here g is some 
non-parametric estimator of the true density g\ this is given by the relative fre¬ 
quency of the values in the sample space for discrete models and by some kernel 
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density estimator for continuous models. See Ghosh (2014b I and Ghosh and 


Basu (2014) for corresponding descriptions in the unrestricted case. Then, us- 


irrg the method of Lagrange multipliers, the estimating equation of the RMSDE 
is given by 


jKiS{x))f^+^{x)u8{x)dx + H{e)X,, = 0 \ ,, 

and h{e) = 0 J ’ ^ > 

where 6{x) = 6n{x) = - 1, K{6) = -i] ^ Xn 

is the vector of Lagrange multipliers, and ug = -^ log fg. 


2.2. Asymptotic Distribution under Discrete Models 


First we consider the case of discrete distributions, where both g and fg 
are densities with respect to some counting measure over the support x = 
{0,1, 2, • • • }. We denote the relative frequency at any point x by r„(a;). Then 
the RMSDE can be obtained as the solution of the estimating equation @ with 
g{x) replaced by r„(x) and the integral replaced by the countable sum over the 
support X- 

To prove the asymptotic properties of the RMSDE under this set-up, we 
define Os = arg^ mm >S'(q^a) ( 5 ,/e), the restricted “best fitting parameter” un¬ 


der g. Then, we have the following result under the conditions (SA1)-(SA7) of 
Ghosh (2014b) provided in the Supplementary material. 


Theorem 2.1. Consider the above set-up of discrete models and assume that 
the conditions (SA1)-(SA7) of Ghosh (201 fb) hold with respect to 0o = {0 : 
h{0) = 0} (instead of Q). Then, we have the following: 

(i) There exists a consistent sequence On of roots to the restricted minimum 
S-divergence estimating equations 

(ii) Asymptotically, y/n ~ Np (^0, PgVgPg'j , where the matrices Pg 

and Vg are as defined in Definition 1.1 in the Supplementary material. 


Next consider the particular case of the model density with g = fg^ for 
some 6*0 S 0 satisfying the given restriction ([^. In this case, we have Os = Oq 
and hence y/n {On — Os'^ ^ N ^0, Pai0o)Va{0o)Pa{0o)^ asymptotically, where 


PaiOo) and 3 .re defined in Dehnition 1.1 in the Supplementary material. 

Interestingly the asymptotic distribution of the RMSDE at the model is also 
independent of the parameter A defining the ^-divergence measure — just as 
in the case of the unrestricted MSDE. It also coincides with the asymptotic 
distribution of the restricted minimum DPD estimators as obtained in IBasul 


et al. (2013b) independently. 


3 
















2.3. The Basu-Lindsay Approach for the RMSDE under Continuous Models 
Now we consider the case of continuous models, where the densities g and 
fe are both continuous with respect to some common dominating measure. 
However, there is a clear incompatibility of measures between the data that 
are discrete and the assumed continuous model; hence we need to use kernel 
density estimator in place of g in the estimating equation (|^ which brings in 
several complications like bandwidth selection, curse of dimensionality etc. and 
complicated conditions are needed for the asymptotic results. The approach of 
Basu-Lindsay (Basu and Lindsay 1994) helps us to avoid such complications 
by using convolution of the assumed model also by the same kernel; see [GhosTil 


and Basu (2014) for several advantages of this approach and corresponding 


derivations in the unrestricted case. 

Let us define the kernel density estimator 5 * and the corresponding smoothed 
versions g* and fg of the densities g and fg respectively: 


f 1 

9nix) = I W{x,y,hn)dGn{y) = - W{x,X„hn), 

i—1 

g*{x) = j W{x,y,h) dG{y), and fg{x) = J W{x,y,h) dFg{y), 


where W{x,y,hn) is a smooth kernel function with bandwidth /i„, G„ is the 
empirical distribution function and G, Fg are distribution functions of g and 
fg respectively. Using the Basu-Lindsay approach, the restricted minimum 
divergence estimator is to be obtained by minimizing the ^'-divergence between 
g* and fg, subject to the restriction ([^. Thus, the corresponding estimating 
equation is given by 

!K{6l{x))f^{xf+^ure{x)dx + H{e)\n = 01 . . 

h{e) = 0 / • 

where 5f,{x) = — 1 and ug{x) = V log fg{x). In general, the resulting esti¬ 

mator is not the same as the RMSDE obtained by minimizing S(c,,\){gni fo) 

©o; we denote it as the restricted minimum S'*-divergence estimator (RMSDE*). 
We follow Ghosh and Basu (2014) to derive the asymptotic distribution of the 

RMSDE*. Let 9^ = arg min S(^a,\){g*, fg) be the restricted “best fitting pa- 

fleOo 

rameter” under g. Then we have the following theorem. 

Theorem 2.2. Consider the above set-up of continuous models and assume 
that the conditions (SB1)-(SB7) of Ghosh and Basu (2014), presented in the 
Supplementary material, hold with respect to ©o = {0 : h{9) = 0}. Then, 

(i) There exists a consistent sequence 6>* of roots to the restricted minimum 
S* -divergence estimating equations 

(a) Asymptotically, ^/n (o), - 09 ^ ~ Np(o,Pa,x {09 )Va,\ {g)Pa,\ {0^ )), 

where Pa,\ (09 ) and Uq.a (g) o,re defined in Definition 1.2 in the Supple¬ 
mentary material. 
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Now, for g = fg^ with 0o G Q satisfying (0, we have Qs = Oq and the 
asymptotic distribution of ^/n ^ 0 * — 9q^ is normal with mean 0 and variance 

Pa {So)Va {Oo)Pa {do), where Pa (Oq) and Va (Oq) are as in Definition 1.2 in 
the Supplementary material. Once again, the asymptotic distribution of the 
RMSDE* turns out to be independent of the parameter A at the model. 

Further, if we assume that the kernel used in smoothing is a-transparent for 
the restricted family Pq = {fg : 9 G 0o} C in the sense of Definition 9.1 
of Ghosh and Basu (2014) presented in the Supplementary material (Definition 
1.3), then it follows that for any 9q G Oq, Ja (^o) = Ja{&o): (^o) = Ki(^o) 

and so Pa (9o) = Pai^o)- Hence at the model density g = fg^ G Pq, the 
asymptotic distribution of the RMSDE* becomes exactly the same as that of 
the RMSDE under discrete model. 


3. 5'-Divergence based Test (SDT) for Composite Hypothesis 

Now we consider the problem of testing composite null hypothesis. Under 
the notations of the previous section, take a fixed (proper) subspace ©o of the 
parameter space 0. Our objective is to test for the hypothesis 


Hq : 9 G Qq against Hi : 9 ^ Qq. (5) 

Following Ghosh et al. (2015), the S'-divergence based test (SDT) statistics 
for testing the above hypothesis can be constructed as 

'^7,a(^/5,Ti ^/3.t) = 2uS'(-^_a) (6) 

where 5(.^p,)(-, •) is the S'-divergence measure with parameter 7 and A and 0 / 3 ,t) 
denote the restricted (under ©g) and unrestricted MSDEs with tuning pa¬ 
rameters /3 and r. Their asymptotic distributions at the model are independent 
of T. Indeed the estimators 0 / 3 ,n and 0/3 ,t 2 are asymptotically equivalent in that 

^/ti ^0/3,n 0/3 ,t 2^ t 0, 


for any ti ^ T 2 . We therefore replace 0 / 3 ,and 9^ ^ by 0,a,o and 0 / 3,0 respectively, 
without altering the asymptotic properties of the test statistics in ([^. The 
advantage of this substitution is that the latter set of estimators minimizes the 
DPD, and thus can be evaluated without a^ kernel smoothing. The asymptotic 
properties of the restricted MDPDE 0/3 = 0 / 3,0 at the model g = fgg is given by, 
Basu et al.| ( [2013b[ ), 


see 


V^(0/3 - 0o) 4 iV(0, P/3(0o)H/3(0o)P;3(0o)). 


This distributional convergence holds under Gonditions (D1)-(D5) of |Basu et al. 
(2011 1 with respect to ©o; we refer to these 5 conditions as “Basu et al. condi¬ 
tions” throughout the rest of the paper. We also assume the standard conditions 
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of asymptotic inference, given by Assumptions A, B, C and D of Lehmann (1983 


p. 429); we refer to them as the “Lehmann conditions”. Both set of conditions 
are presented in the supplementary material. 

Now, to explore the asymptotic properties of the proposed (modihed) SDT 


statistics 711 ^ 1 ( 0 / 3 ,0^) = g, ^/ 3 ,o) for testing the composite hypothesis (5), 

we re-define the null parameter space ©g in terms of r restrictions of the form 
1^ . We also assume that the corresponding p x r matrix H(0) = exists 

and it is a continuous function of 9 with rank r. Indeed, this condition can be 
seen to hold for most parametric hypothesis. We start with the asymptotic null 
distribution of the proposed SDT. 




Theorem 3.1. Suppose the model density satisfies the Lehmann and Basu et 
al. conditions with respect to both 0 and ©g and Hq is true with 0 q G ©q being 
the true parameter value. Then, the asymptotic null distribution of the SDT 

statistic T^l^( 6 p, 9 i 3 ) coincides with the distribution ofJ 2 l=i , where 

— — T,/3 

,Z.r o,re independent standard normal variables, Ci (^o)? Cr (do) 
are the nonzero eigenvalues of A~^{9o)'Zg(0o) with 

Mdo) = [Jg^Oo) - P^i9o)m9o)[J^\9o) - Pp(do)] 

and r = rank (Vi 3 ( 0 o)[J)f^( 9 o) - Pi3i9o)]A^(0o)[Jp^ (do) - -P/3(6'o)] Va(6'o)) . 


Noting the similarity of the above asymptotic null distribution with the case 
of testing simple null hypothesis, we can find critical values of the proposed SDT 
following Remark 3 of Basu et al. (2013aI. We can also derive an asymptotic 
power approximation at any point 9* ^ ©g; if 9* ^ ©g is the true parameter 

value then 9^ ^ 9* but 0/3 ^ 0g for some 0g G ©g with 9* 0g. Define ^g(d) = 

Jp^(9)Vp{9)Jp^(9). Then, by an argument similar to the one in the above 
theorem, one can show under the Basu et al. conditions that 


\/n 



S/3(r) Ai2 1\ 

Pp(do)Vp{9o)P0i9o)\)' 


for some p x p matrix A 12 = Ai 2 ( 0 *, 0g). Further define 




M2,^,A(r,0g)= 


Theorem 3.2. Suppose the model density satisfies the Lehmann and Basu et 
al. conditions with respect to both © and ©g and take any d* ^ Oq. An asymp¬ 
totic approximation to the power function of the SDT statistic T^'!,^(9p, 9 13 ) for 
testing at the significance level a is given by 


= 1 - $ f -- ^ ^ f 11, d*^e^ 


(Tp,-^,x(9\9o) \ 2n 
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where ta^ 


sdtt: 


(1)/ 


7. A ^ 


is the (1 — quantile of the asymptotic null distribution of the 
Ip.Ojs) and o/^xiS*, Oo)"^ = +-^i^^,a^i 2 -^ 2 . 7 ,a + 


The above theorem may be proved by a routine application of Taylor series 
and is omitted. This power approximation can help us to obtain the required 
sample size in any planned experiment to achieve a desired power. The theo¬ 
rem also shows that the proposed S'-divergence based test is consistent for the 
composite hypotheses at any 9* ^ Qq. 


4. Robustness of the SDT for Composite Hypothesis 

4.I. Influence Function of the Test 

Let us define the statistical functional corresponding to the proposed SDT 
for the composite hypothesis as 

^7,a(^) = >5'(7,A)(/(7^(G)!/[7^(g))j 

where Up{G) is the MDPDE functional and UplfG) is the restricted MDPDE 
functional under 0o as defined in Ghosh (2014a). Consider the contaminated 


distribution = {l — e)G + e/\y, where Ay is the degenerate distribution at the 
contamination point y and e is the contamination proportion. Then Hampel’s 


1979 


first-order influence function (Hampel et al. 1986, Rousseeuw and Ronchetti 


1981) of the SDT functional is given by 

= M,^,^x{Up{G),Up{G)flF{y-U0,G) 
+ M2,,MU0{G),MG)flFiy;U0,G), 


IF{y;T^%G)= 


e=0 


where IF{y,Up,G) and IF{y;U 0 ,G) are the influence functions (IFs) of 
and U 13 (G) respectively. Now, under the null hypothesis if 9q G ©q is the 
true value of parameter with G = Fg^ then Up{Fg^f) — 0o, Ui 3 (Fg^f) = 9 q and 
-^i, 7 ,A(^o, ^ 0 ) = 0 for all J = 1,2; hence the first-order IF of our SDT statistic 
for the composite hypothesis also becomes zero at the null. 

Therefore, to assess the robustness of the test, we consider the second order 

influence function of our statistic defined as IF 2 (y\T^^l, G) = ^T^^l(H^) 

e=0 

In the particular case G = Fg^ with 6*0 G ©o, this second order influence function 
of the SDT simplifies to 

IF2iy;T^%Fg^) = Dp{y;9o)^A^(9o)Dp(y;0o), 

where Dp(y;0o) = IF{y,Up,Fgfj — IF{y;Ui 3 , Fg^) . Note that the IF of the 
SDT at the composite null is also independent of A and it is bounded if and only 
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if the influence function of the corresponding unrestricted and restricted MDPD 
functionals are both bounded or both diverge at the same rate. However, for 
most parametric models, these IFs of MDPDE and RMDPDE are seen to be 
bounded whenever /3 > 0 but unbounded at /3 = 0. 

4-2. Level and Power Influence Functions 

Now we consider the influence function of level and power of the SDT for 
composite hypothesis. Since the SDT is consistent, its asymptotic power is one 
at any fixed alternative; so we consider the asymptotic power under contiguous 
alternatives Hi^n ■ dn = + ;^ G 0 — 0o with A G — {0} and 6 o G 0o. 

Clearly, to ensure the existence of such a Oq in 0o there must exist a limit 
point 9o of the null parameter space 0o; we assume 0o to be a closed subset of 
0. Next, following Hampel et al. (19861, we also consider the contaminations 


over these contiguous alternatives such that their effect tends to zero as 
tends to Oq at the same rate to avoid confusion between the null and alternative 


3 see|Huber-Carol 1970 

Heritler and Ronchettlf 1994 

Toma 

2011 

1. So, consider the contaminated distributions 

n,e,y 


and Bronlatowski 

and defined in Definition 1.4 of the supplementary material, for level 

and power respectively and the level Influence function (LIF) and the power 
influence function {PIF) as defined therein; also see Ghosh et al. (2015). 

Let us first derive a general expression for asymptotic power P(A, e) = 


lim 

n—>-oo 




FF. 


Ti 


( 1 )/ 


7,A \ 


> t\ 




for testing composite hypothesis under con¬ 


tamination in the following theorem. Here, Xp denote a central chi-square ran¬ 
dom variable with p degrees of freedom and Xp s denote a non-central chi-square 
random variable with degrees of freedom p and non-centrallty parameter S. 

Theorem 4.1. Assume that the Lehmann and Basu et al. conditions hold for 
the model density and the null parameter space 0o is such that there exists a 
limit point 9 q G 0o satisfying 9n = 9o + G 0 — 0o for all A G — {0}. 


Then for any A G 


and € > 0, we have the following: 


(i) The asymptotic distribution ofT}^ {{dp, dp) under Fff^^ y is the same as that 
of the quadratic form W"’"A.y(9{f)W, where W ~ Np ^A*, E,3(0o)^, where 


A* = 


A- 


\lF{y,Up,Fe,) - IF{y,Up,F 0 ,)] 


7,/3 , 


Equivalently, this distribution is the same as that ofJ2i=i Ci (^o)x^ 


where 


Si,. 


= 17 with Vp^.y{9tf) being the 


matrix of normalized eigenvectors of A~^{9 q)'Ep{9q). 


' 7,/3 , 


(a) P(A,e) = x; C4’^{9o,A*)P [xr+2v > , K where C(i)”"(^'o) is 

V Qi) (^ 0 ) / 

the minimum of {9q)s over i = I,...,?* and {do, A) is as Defini¬ 
tion 1.5 in the Supplementary material. 





















Corollary 4.2. Putting e = 0 m above theorem, we get the asymptotic power 
under the contiguous alternatives Hi n : 0 — 9n = Oq + as 


OO 

= P(A,e = 0) = V Cy(0o,A)/" 


i ?=0 


Xr+ 2 'L 


> ta~*K(l) {do) 




Corollary 4.3. Putting A = 0 in above theorem, 
under the probability distribution Fn,e,y 

OO ___ 

5; = P{A = 0,e)=J2 C:i'^ieo,eDp{y,9o))P 

v—0 


we get the asymptotic level 


> fhn 
Ar+2v ^ ‘-a 


/C(ir’^0o) 


Further, if we also take e = 0, then y coincides with the null distribution and 
the asymptotic distribution of the proposed SDT obtained from part (i) of the 
above theorem coincides with asymptotic null distribution obtained independently 


in Theorem 3.1: hence oq = ot, as expected. 


In practice, we can use finite truncation to approximate the infinite series in 
the above theorem, as discussed in Remark 3.1 of Ghosh et al. (2015). 

Finally we will compute the level and power influence function of the pro¬ 
posed ^-divergence based test statistics for c omp osite hypothesis from the ex¬ 


pression of P(A, e) as obtained in Theorem 


3.2 


In particular, the power in¬ 


fluence function (PIF) comes from a simple differentiation of P(A, e) at e = 0 
and then the level influence function {LIF) can be derived just by substituting 
A = 0. The following theorem presents the form of the PIF and LIF of the 
proposed SDT. Clearly, both the LIF and PIF can be seen to be bounded 
whenever the influence function of the MDPDE under the null and overall pa¬ 
rameter space both are bounded or both diverges at the same rate; this in turn 
implies the size and power robustness of the proposed SDT for /3 > 0. 

Theorem 4.4. Assume that the Lehmann and Basu et al. conditions hold for 
the model density and the influence function IF{y\Up of the minimum 

DPD estimator is bounded. Then the power and level influence function of the 
proposed test statistics for composite hypothesis have the forms 

PIF{y-T^l,Fe,) = -P{A,e)l^^ = Dp{y,9o)^C*{A,9^,^,f,a) 

and LIF{y-T%Fe^) = do)^C*(0,0o, 7,/?,«), 


where C*{A,9o,-f,l3,a) = ^ 




PC2’^{9o,t) 




P 


Xr+24f ^ 




5. Example: Testing Normal Mean with unknown variance 

Now, we illustrate the proposed theory in the context of testing the mean of a 
normal distribution with unknown variance; the same problem with known vari¬ 
ance is illustrated in Ghosh et al. (2015). Suppose we have a sample Xi,..., A„ 
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of size n from a population having a univariate normal density with 

both the parameters unknown. Based on this sample, we want to test the hy¬ 
pothesis Hq : /j, = p,o for a pre-specified real number /io; cr is not assumed to 
be known. Suppose Op = {flp, op) is the MDPDE of 0 = (/r, a) with tuning 
parameter /3 and the corresponding restricted MDPDE under the above null is 
Op = (/io, op). Then a simple calculation shows that the proposed SDT for 
testing above Hq is given by 



2nK.y A 
~AB 



- l-B — 1-A 

(pj-jj^^op__op^_^- 

^Bap^ P- Aap^ 




for A, B 7 ^ 0, where = (27r) 2(1 + ^) 2 . However, at A = 0 or — 0, the 
above test statistic is defined in a limiting sense as 




(M/3 - 

- 2 

op 


A = 0, 



(M/3 - 

op J 


B = 0. 


In particular, p = \ = p = 0, kq = I, Oq = {X, the 

unrestricted MLE of 0 and Oq = (/to, ^ ~ the restricted MLE 

of 0 under the null hypothesis. Then, the SDT statistic further simplifies to 


^0,0 (^0,^*0) = »^log 




this is again the likelihood ratio test statistic for the problem under considera¬ 
tion. Thus, the proposed SDT is a robust generalization of the LRT. 

Erom Theorem |3.1[ the asymptotic null distribution of the SDT statistics 

T^^l{0p,0p) for the composite null is also given by the distribution of 

where Zi ~ Xi C 7 ’^ = ^ 7+2 with vp = (/.|^^) 3/2 o^ being the asymptotic 
variance of the MDPDE pp. Note that, this asymptotic null distribution is 
exactly the same as that in case of testing simple hypothesis with known o. 
In fact, all the asymptotic properties and the influence function of the test 
statistics for this case of composite hypothesis turns out to be exactly the same 
as obtained in the case of known 0 (Ghosh et ah, 2015); the main reason behind 
this is the asymptotic independence of the MDPDE or RMDPDE of /t and 0 
under the normal model. 


5.1. A Real data Example: Telephone-fault data 

Consider an interesting real dataset containing the records on telephone line 
faults presented and analyzed by Welch (1987); also studied by Simpson (1989) 
and Basu et al. (2013a|b|. Table ^ presents the data on the ordered differences 
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between the inverse rates of test and control in 14 matched pairs of areas. 
These data could have been modeled by the normal distribution with mean /x 
and standard deviation a if not for the first observation which produces a huge 
outlier with respect to the remaining 13 observations. The presence of this 
outlying observation changes the MLE of the parameters /x and a drastically 
whereas the MDPDE with a slightly larger tuning parameter /3 produces robust 
estimators ( Basu et al.[ 2013a|b[ ); Tablepresents the MDPDE and MLE (it is 
the MDPDE with ^ = 0) of the parameters under the full data and also after 
removing the outlying first observation. 


Table 1: Telephone-fault data 


Pair 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

Difference 

-988 

-135 

-78 

3 

59 

83 

93 

no 

189 

197 

204 

229 

289 

310 


Table 2: MDPDEs of /x and cr for the Telephone-fault data 



/3 

0 

0.05 

0.1 

0.2 

0.5 

Full Data 

M 

40.357 

62.804 

115.435 

125.861 

143.085 


a 

311.332 

273.909 

148.766 

120.105 

96.564 

Outlier 


119.462 

120.844 

122.361 

125.893 

143.085 

Deleted data 

a 

129.532 

127.406 

125.128 

120.009 

96.564 


For the present data set we consider the problem of testing two different 
hypothesis on the mean parameter /x, namely : /x = 0 and Hq : ^ = 115 
against their respective omnibus alternative. We consider the two cases of known 
and unknown tr; for the known a case we use its robust estimator 132 for the 
value of cr as suggested by Basu et al. ( 2013a|b |. However, due to the non-robust 
nature of the MLE, the likelihood ratio test (and equivalently the traditional 
z-test or t-test) fails to reject the null Hq but rejects the null Hq due to the 
presence of the large outlier; on the other hand after the removal of the large 
outlier the same test fails to reject the null Hq but soundly rejects Hq. Here 
we apply the proposed S'-divergence based test for these two testing problems 
using the formulas given in Ghosh et al. (2015) and in the present paper for 
known and unknown cr respectively. The p-values obtained by the S'-divergence 
based tests (SDT) for the full data are presented in Figure along with that 
corresponding to LRT. The robust nature of the proposed SDT is clear for both 
the cases. 
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